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The iota toxin which is produced by Clostridium perfringens type E, is a binary toxin consisting of two 
independent polypeptides: la, which is an ADP-ribosyltransferase, and lb, which is involved in the binding and 
internalization of the toxin into the cell. Two degenerate oligonucleotide probes deduced from partial amino 
acid sequence of each component of C. spiroforme toxin, which is closely related to the iota toxin, were used to 
clone three overlapping DNA fragments containing the iota-toxin genes from C. perfringens type E plasmid 
DNA. Two genes, in the same orientation, coding for la (387 amino acids) and lb (875 amino acids) and 
separated by 243 noncoding nucleotides were identified. A predicted signal peptide was found for each 
component, and the secreted lb displays two domains, the propeptide (172 amino acids) and the mature protein 
(664 amino acids). The la gene has been expressed in Escherichia coli and C. perfringens, under the control of 
its own promoter. The recombinant polypeptide obtained was recognized by la antibodies and ADP-ribosylated 
actin. The expression of the lb gene was obtained in E. coli harboring a recombinant plasmid encompassing the 
putative promoter upstream of the la gene and the la and lb genes. Two residues which have been found to be 
involved in the NAD+ binding site of diphtheria and pseudomonas toxins are conserved in the predicted la 
sequence (Glu-14 and Trp-19). The predicted amino acid lb sequence shows 33.9% identity with and 54.4% 
similarity to the protective antigen of the anthrax toxin complex. In particular, the central region of lb, which 
contains a predicted transmembrane segment (Leu-292 to Ser-308), presents 45% identity with the correspond- 
ing protective antigen sequence which is involved in the translocation of the toxin across the cell membrane. 



Clostridium perfringens is a ubiquitous bacteria which 
causes food-borne illness and gas gangrene in humans and 
digestive diseases in animals. This species is divided into five 
toxin types on the basis of the production of four major lethal 
toxins (alpha, beta, epsilon, and iota) (6). C. perfringens type 
E produces the iota toxin and has been implicated in calf and 
lamb enterotoxemias (6). 

Iota toxin is a binary toxin which is composed of two 
independent polypeptide chains called iota a (la) (M c , 
47,500) and iota b (lb) (M r 71,500), which are not associated 
by either covalent or noncovalent bonds (26, 27). It has been 
shown that the la light chain causes ADP-ribosylation of 
globular skeletal muscle and nonmuscle actin at Arg-177 (23, 
28). The heavy-chain lb is required for penetration of la into 
the cytosol, and lb must undergo limited proteolysis to be 
functionally active (3). 

Iota toxin shares a comparable structure and mode of 
action with both Clostridium spiroforme toxin and C2 toxin 
of Clostridium botulinum types C and D (3). However, iota 
toxin is antigenically related to C. spiroforme toxin but not 
to C. botulinum C2 toxin (16, 19). An ADP-ribosyltrans- 
ferase antigenically related to the enzymatic component of 
iota and C. spiroforme toxins was found in one Clostridium 
difficile strain (18). Iota and C2 toxins, like leucocidin (15), 
Staphylococcus gamma lysin (5), and anthrax (13) toxins, 
belong to the binary toxins formed of independent enzymatic 
and binding components. Therefore, this toxin family differs 
from the classical A-B toxins, such as the cholera and related 
toxins, which consist of a subunit or domain (A), with a 
specific function, and a binding domain or subunit(s) (B), and 
are assembled in a defined structure (25). In contrast, the 
protective antigen (PA) of the anthrax toxin binds to a cell 
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surface receptor, is cleaved by a cell surface protease, and 
then is able to bind the enzymatic component (edema factor 
or lethal factor) (13). 

Little is known about the genetics of the Clostridium 
binary toxins. In the present study, we report the character- 
ization of the iota-toxin genes and the relationship with 
anthrax toxin genes. We also studied their expression in 
recombinant Escherichia coli and C. perfringens. 



MATERIALS AND METHODS 

C. spiroforme toxin protein sequencing. Light (Sa) and 
heavy chains (Sb) of C. spiroforme toxin were purified from 
C. spiroforme NCTC 11493 as previously described (19). 
Purified toxin chains were run on a 10% polyacrylamide gel 
containing 0.1% sodium dodecyl sulfate (SDS) and trans- 
ferred to a polyvinylidene difluoride membrane (Immobilon; 
Millipore, Paris, France). After Coomassie blue staining and 
destaining, the protein bands were cut out and digested by 
trypsin. The peptides were separated by high-performance 
liquid chromatography, and one internal peptide of each 
toxin chain was microsequenced with a gas-phase protein 
sequencer (Applied Biosystems). 

Bacterial DNA and plasmids. C. spiroforme NCTC 11493 
and Cs246, and C. perfringens type E strain NCIB 10748 
were grown in broth containing Trypticase (30 g/liter), yeast 
extract (20 g/liter), glucose (5 g/liter), and cysteine-HCl (0.5 
g/liter) (pH 7.2) under anaerobic conditions. Clostridium 
genomic and plasmid DNAs were extracted and purified as 
previously described (17). 

Plasmid pUC19 (Appligene, Strasbourg, France) and pA- 
CYC184 (New England Biolabs, Ozyme, Paris, France) 
were used for cloning in E. coli TGI (Mac-pro) thi supE 
hsdD5 F' (traD36proAB + AlacZM15), and pJIR418 (24) was 
used for expression in C. perfringens S13 (24). 
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Probes and hybridization conditions. Oligonucleotides 
were synthesized by the phosphoramidite method with a 
Cyclone Milligen automated DNA synthesizer. Genescreen 
Plus filters (New England Nuclear Research Products, du 
Pont Nemours, Paris, France) were pretreated with 200 u.g 
of heat-denaturated salmon sperm DNA per ml in 1 M 
NaCl-10% dextran sulfate-0.5% SDS-50 mM Tris-HCl (pH 
7.5) at 40°C and then treated with a 5'- 32 P-labeled oligonu- 
cleotide (10 6 cpm/ml) in the same mixture overnight at 40°C. 
Filters were washed in 6x SSC (lx SSC is 0.15 M NaCl plus 
0.015 M sodium citrate)-0.1% SDS at 40°C for 2 h and 
exposed overnight to Fujii RX films. 

The recombinant DNAs used as probes were labeled by 
the random method with the Multiprime kit (Amersham, 
Paris, France). The hybridizations were carried out with the 
Rapid Hybridization Buffer (Amersham) at 65°C for 4 h. The 
filters were washed in 0.1 x SSC-0.1% SDS at 65°C for 1 h. 

PCR amplification. One hundred nanograms of DNA was 
amplified by the polymerase chain reaction (PCR) in a total 
volume of 100 p.1 of 10 mM Tris-HCl (pH 8.3)-50 mM KC1-4 
mM MgCl 2 -0.1% bovine serum albumin-100 \xM deoxynu- 
cleoside triphosphate-10 mM p-mercaptoethanol-50 pmol of 
each primer-2.5 U of Taq polymerase (Amersham). Reac- 
tion mixtures were denatured at 95°C for 2 min and then 
subjected to 30 cycles of denaturation (20 s at 94°C), anneal- 
ing (20 s at 5°C below the theoretical melting temperature of 
the primers), and extension (20 s at 72°C) in a Prem III 
thermal cycler (Flobio, Paris, France). 

Other molecular biology techniques. Ligation and prepara- 
tion of plasmid DNA from E. coli were conducted as 
described by Maniatis et al. (14). Bacteria were transformed 
by electroporation (17). C. perfringens electroporation was 
done according to the method of Scott and Rood (22). T4 
polynucleotide kinase and calf intestinal phosphatase were 
obtained from Boehringer-Mannheim France, other en- 
zymes were obtained from Pharmacia (Paris, France), and 
the Erase-a-base was obtained from Promega (Coger, Paris, 
France). DNA was sequenced by the dideoxy chain termi- 
nator procedure with a Sequenase kit (United States Bio- 
chemical Corp., Cleveland, Ohio). 

Potential biohazards associated with the experiments de- 
scribed had been approved by the French National Control 
Committee. 

Expression of recombinant proteins. E. coli TGI trans- 
formed with recombinant plasmid pUC19 or pACYC184 was 
grown in LB medium (14)-50 ng of ampicillin per ml to an 
optical density at 600 nm of 1. One milliliter of the cell 
suspension was pelleted in a microcentrifuge, resuspended 
in 100 juul of 20 mM HEPES (AT-2-hydroxyethylpiperazine- 
A/^'-2-ethanesulfonic acid; pH 7.5), and sonicated. One to 20 
ul was mixed with 5 ul of Laemmli sample buffer, boiled for 
3 min, and loaded onto a 0.1% SDS-12% polyacrylamide gel 
(12). 

For immunobloting analysis, proteins were transferred 
electrophoretically to nitrocellulose (Hybond C; Amer- 
sham). The nitrocellulose was first incubated for 1 h in 
phosphate-buffered saline containing 5% milk and then incu- 
bated overnight at room temperature with 1:400 dilution of 
immunopurified la or Sb (18, 19) rabbit antibodies. Bound 
antibodies were detected with peroxidase-labeled protein A 
and the chemiluminescence kit provided by Amersham. 

Gel assay for ADP-ribosylation. In vitro ADP-ribosylation 
assay were performed with G actin isolated from Xenopus 
laevis oocytes as described previously (18). The mixture for 
the polyacrylamide gel electrophoresis (PAGE) assay (total 
volume, 20 ul) contained 7 |xg of oocyte actin, 100 mM 



Sb 256 Y L S S Y L E S N T A 
P131 5'-TAT TTI TCI TCI TAT TTI GAA TCI AAT ACI GCI 
b TAT GTA TCA AGT TAT TTA GAA TCA AAT ACC GCT 

Sb G D P Y 270 

P131 GGI GAT CCI TAT -3' 



Sa 127 DKPMYVYYFES 
P222 5' -GAT AAA CCI ATG TAT GTI TAT TAT TTT GAA TCI 
b GAT AAA CCT ATA AAT GTT TAT TAT TTT GAG TCT 

Sa P E 139 

P222 CCI GAA- 3 1 

FIG. 1. Amino acid sequences of internal peptides of C. spiro- 
forme toxin components Sb and Sa and oligonucleotide probes P131 
and P222 complementaiy to these sequences, respectively, accord- 
ing to the Clostridium codon usage (see the text). The nucleotide 
sequences determined by DNA sequencing of C. perfringens type E 
(NCIB 10748) are shown in lines b. The amino acids are numbered 
according to the iota-toxin sequence. 



HEPES, 5 mM ATP, 2.5 mM ADP-ribose, and 5 x 10 5 cpm 
of 32 P-NAD (specific activity, 30 Ci/mmol; Dupont, NEN 
Research Products, Boston, Mass.). Between 1 and 10 \d of 
a fraction to be tested for enzymatic activity was added. 
After incubation for 1 h at 37°C, sample buffer was added 
and the preparation was fractionated by SDS-PAGE and 
processed by autoradiography. 

Purification of the recombinant la. C. perfringens S13 
harboring pMRP76 was grown in TGY under anaerobic 
conditions to an optical density at 600 nm of 1. Culture 
supernatant (1 liter) was precipitated by ammonium sulfate 
(70% saturation). The precipitate was dissolved in 30 ml of 
10 mM Tris (pH 7.5), dialyzed, and applied to a column (2 by 
15 cm) containing DEAE-Sepharose CL6B (Pharmacia) 
equilibrated with 10 bed volumes of the same buffer. The gel 
was washed and eluted with 0.1 M NaCl in the same buffer. 
The eluate was purified on an immunoaffinity column with 
rabbit antibodies against C. perfringens toxin component la 
as previously described (19). The column was eluted with 4 
M guanidine hydrochloride (Sigma) in 10 mM Tris (pH 7.5). 
The eluted fractions were dialyzed against the Tris buffer. 

Nucleotide sequence accession number. The nucleotide 
sequences reported in this paper have been submitted to the 
EMBL Data Library with accession number X73562. 

RESULTS 

Cloning of the iota-toxin genes. Several different peptides 
of C. spiroforme toxin chains Sa and Sb were microse- 
quenced. Two oligonucleotides (P131 and P222), deduced 
from internal peptide sequences of Sb and Sa, respectively, 
were synthesized according to the Clostridium codon usage 
(31) and with inosine at the most degenerated positions (Fig. 
1). These probes hybridized with total and plasmid DNA of 
C. spiroforme NCTC 11493 and Cs246 and C. perfringens 
NCIB 10748 (data not shown), indicating that iota-toxin 
genes are localized on plasmid DNA of these Clostridium 
strains. Since DNA was easier to prepare from C. perfrin- 
gens than from C. spiroforme, we chose to clone iota-toxin 
genes from C. perfringens NCIB 10748. 

The 2-kb iftndlll DNA fragment of C. perfringens NCIB 
10748 plasmid DNA recognized by P131 was cloned into 



Vol. 61, 1993 



CHARACTERIZATION OF C. PERFRINGENS IOTA-TOXIN GENES 5149 



H — I— H- 



H B 

— H 

H B 

H 



I 1 — H > MRP82 



pUC19 (pMRP44). The synthetic oligonucleotide P190 com- 
plementary to the 5' extremity of the 2-kb Hindlll DNA 
fragment (nucleotides 3288 to 3308) was used to clone the 
3.8-kbfic/I overlapping DNA fragment (pMRP67) (Fig. 2). A 
third clone (pMRP82), which contains the 1.2-kb £coRI 
DNA fragment overlapping with the 3' extremity of the 
insert of pMR44, was obtained by using the primer P238 
complementary to nucleotides 5141 to 5161. 

The primer P222, deduced from the sequence of an inter- 
nal peptide of Sa, hybridized with C. spiroforme and C. 
perfiingens type E DNAs and also with pMRP67, suggesting 
that the la gene is in the proximity of the lb gene. 

The overlapping DNA fragments cloned in pMRP44, 
pMRP67, and pMRP82 were sequenced (Fig. 3). The DNA 
fragment cloned in pMRP44 was found to contain two 
internal Hindlll sites, indicating a partial digestion of the 
DNA used for cloning or a rearrangement of the cloned 
Hindlll DNA fragments. One of the internal Hindlll sites 
was localized on the overlapping region with pMRP67 (Fig. 
2 and 3). Two primers were synthesized on each part of the 
second Hindlll site and in the opposite direction from P231 
(positions 4638 to 4618) and P212 (positions 4225 to 4241) 
(Fig. 3). The PCR amplification products with these primers, 
C. perfiingens DNA, and pMRP44 were the same size (data 
not shown). These results showed that the two internal 
Mndlll sites in pMRP44 are the result of a partial DNA 
digestion. 

Features of iota-toxin genes. The 5,747 nucleotide sequence 
showed two open reading frames. The open reading frame 
near the 5' extremity and recognized by P222, which was 
deduced from the Sa internal peptide sequence, was as- 
signed to the la gene, and the downstream open reading 
frame, recognized by P131, was deduced from the internal 
Sb sequence and assigned to the lb gene. 

The la gene starts at the initiation codon ATG at position 
1465 and ends at the stop codon at position 2628. A consen- 
sus ribosome binding site, GGAGGG, is localized 6 nucle- 
otides upstream of the initiation codon. DNA stretches 
TATAAT and TTGTCAT, homologous to the -10 and -35 
Clostridium consensus promoter regions (31), are identified 
between positions 1304 and 1333 (Fig. 3). Three direct 
repeats and one inverted repeat are localized in the promoter 
region (Fig. 3). 



The lb gene, from the initiation codon ATG (position 2872) 
to the stop codon at position 5499, is preceded by a consen- 
sus ribosome binding site (GGAGG) between positions 2860 
and 2864. The lb gene is located on the same frame as the la 
gene. On the 243 noncoding nucleotides between the two 
genes, no promoter consensus sequence has been found. No 
long inverted repeats have been identified downstream of the 
la and lb genes. 

Analysis of the deduced amino acid sequences, (i) la se- 
quence. The deduced polypeptide is composed of 387 amino 
acids and has a predicted molecular mass of 45,121 Da. The 
41 N-terminal amino acids contained residues characteristic 
of a signal peptide (20): the presence of charged N-terminal 
residues at positions 2, 3, and 6 followed by a long hydro- 
phobic core, of a turn residue (Pro-23), and of a proteolytic 
cleavage site (Arg-41 to Ala-42). Furthermore, the amino 
acid sequence from Ala-42 to Glu-56 is in agreement with the 
sequence determined by protein sequencing of the N-termi- 
nal part of Sa (X-X-Ile-Glu-Pro-Pro-Pro-Asp-Phe-Leu-Lys- 
Lys-Lys-Glu). The discordant residues could be due to 
differences between Sa and la or to inaccuracies in protein 
sequencing (the presence of contaminating proteins or 
breakdown products which interfere with sequence analy- 
sis). However, the cleavage sites of the clostridial signal 
peptidases have not been fully investigated. It is not ex- 
cluded that the cleavage by signal peptidase occurs at 
Ala-42, or Ala-34 with a subsequent cleavage at Arg-41 to 
Ala-42 by clostridial peptidase, since Ala-X represents a 
common site of bacterial signal peptidases (29). 

The mature la protein is formed of 346 amino acids. The 
predicted molecular mass (40,392 Da) is comparable to that 
found for purified native Sa (44,000) and slightly lower than 
that of native la (47,500). The predicted pi of 5.01 agrees 
with the experimentally determined pi of la (5.2) and is 
lower than that of Sa (pi 6.2) (19, 26). The differences in the 
predicted and experimentally determined molecular masses 
of la could be due to a particular conformational structure. 

The la sequence did not show any significant similarity 
with other known protein sequences by FASTA and Blast 
programs. Domenighini et al. have identified conserved 
amino acids involved in the NAD+ binding site of ADP- 
ribosylating toxins (4). Recently, it has been shown that the 
peptide Phe-9 to Gly-19 of the ADP-ribosylating enzyme C3 
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£1 GGT La CAA GAA TTT La La ACT L C ACA Lt L T La Lt GAT TTT 
2 U5 AAT AAA ATA GAA AAT ATA GAT OCT L T AAA GAA La Lg GAA La AAA 




3,4 L T ATA CAA ATT aLa L T La Lt Lt Lt La La Lt La GCT Lg 
„ll ATA AGT CAA L G La TCA Lt AAT GTA TAT GCA La Lt L T La CAA 



taLLsLLLtLtcLaa 



La Lt Lt AAA Lt ATA TCT GTA L T La ATA TTA TAT TTA ATT TTA 

ACT AGT TCA TTT Lt AGT TAT L T TAT GCA CAA GAT TTA CAA ATA La 

AGC L T TAT ATT aLa GAT La GCT TTT ATT GAA AGA La GAA GAT TTT 

CTT AAA GAT AAA GAA AAT GCT ATT CAA TGG GAA AAA L G GAG GCT GAA 

La GTA GAA AAA Lc CTT GAT ACA CTT La AAA GAA GCA La GAA TTA 

TAT AAA AAA GAT TCT GAA La ATA AGT AAC TAC TCT CAG La AGA CAG 



AAT GTT TAT TAT TTT GAG TCT CCA GAG AAA TTT GCG TTT L T AAA GAA 
ATA AGA ACA La AAT La L T La ATT TCT TTA L G AAA TTT AAT GAG 



GAT GTT TCT TTA TAT GAA CCA GGT AAT GGC GAT GAA AAG CCT La La 
CTA CTT ATA CAT TTG La TTA CCA La AAT IcT GGT ATG La CCA TAT 



AAG ATA GAC AAA ATT GTT CGT La La ATA La GGA AAG CAA TAT ATA 

AAA GCT GAA GCT TCT ATT GTA AAC AGT CTT GAT TTT La GAT GAT GTA 

AGT AAA GGT GAT TTA TGG GGA AAA GAA AAT TAT AGT GAT TGG AGT AAT 

AAA TTA ACT CCT AAT GAA CTT GCT GAT GTA AAT Lc TAT ATG CGT La 

La TAT ACC GCA ATT AAT AAC TAT TTA ATA TCA AAT GGT CCT TTA AAT 

AAT CCT AAT OCA GAA CTA GAC TCT AAA GTA AAT AAC ATT GAA Lc GCA 
HI TTA AAG CTC ACA CCT ATT CCA TCT AAC TTA ATT GTA TAT AGA AGG TCT 



promoter sequences are underlined. The predicted 
and inverted repeats on the promoter region are ' 



M £ TTA La TTA ATG GCG CCA ATC La AAT GGT GAT TTG AAA TTT GAA La 

S lS AAG AAA GTA GAC L G CTT TTA ACT GAA GAT AAT TCA AGT ATA AAA TCT 

3 dl ATC CGA TGG ACA La Lg ATA ATT CCT TCT GAA GAT GGT GAA TAT ATA 
3 211 TTG TCA L T Lt AGA AAT GAT GTA TTA ATG La ATA AAT GCT AAA L G 
All L T ATT GCA AAA ACA CTT AAG L T Lt ATG AAA AAA GGT CAG GCA TAC 
3^ L T ATT La ATA La ATA La GAC La Lt La GGT TCA ATT L T AAT 
£1 TTA L T GTT L T AAA CTT TAT TGG GAA TTA L T GGA AAT AAA ACA GTA 
3^ ATA CCT GAA GAA Lc TTA TTT Lc CGA GAT TAC TCT AAA ATA GAT GAA 
.IS Lt Lt Lg Lt ATA CCT L T AAT AAT L T TTT GAT GTA AGA TTT TTT 
£1 AGC GCA GCC TGG GAA GAT GAA GAT TTA GAT ACT GAT AAT GAT L T ATT 
SS5 La GAT GCT TAT La AAA L T L C Lt L T ATC AAA GAT TCA ATT GCA 
GTA AAA TGG L T Lt L T TTT GCA GAA La GGA TAT La AAA TAT GTA 

k l t l t s» La La l t l c l t l l c l a l t l» l t TA t 

3 2 S^ caa aaa Lt La Lt Lt Lt Lt La Lt La aaa La ^aa La La 
S?S Lt Lt La Lt La La Lt La Lt Lt ^ga La Lt atg La aat 
3?S? La Lt La Lt Lt Lt La Lt Lt La Lt Lt La La La La 
All Lt Lt Lg Lt Lt La Lt Lt La Lt Lt La Lt La Lt La 
3^ La Lt Lt Lt Lt La Lt La Lt La Lt Lt Lt Lt La Lt 
,51 La loo Lt Lt Lc aca La Lt Lt La Lt Lt Lg La Lt Lt 
Ail Lt La La La Lg Lt Lt La La Lt La Lt La La La La 
• IS La Lt La Lt Lc Lt La La Lt Lt Lt Lt Lt Lt La Lt 

mdicate^by Tn asterisk! and the directed 
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Iota 10-L KDKENAIQWE K-21 

DT 144-S SSVEYINNWE Q-155 

ET 549-G GRLETILGWG N-560 

PT 17-D VFQNGFTAWG N-28 

C3 (C-003-9) 9-F TNVEEAKKWG N-20 

C3 (C-468) 9-FTNIDQAKAWG N-20 

ED IN 9-F TDLDEATKWG N-20 

consensus E/D X X X X W 

FIG. 4. Alignment of the la amino acid sequence with those 
involved in the NAD+ binding site of DT, ET, PT, C3 enzyme from 
C. botulinum C-003-9, from C. botulinum C-468, and from Staphy- 
lococcus EDIN. The conserved amino acids which have been 
assigned to cross-link the nicotinamide ring of NAD+ in DT, ET, 
PT, and the corresponding residues in the other sequences are 
shown in boldface. 




FIG. 3— Continued. 

from C. botulinum C-003-9 constitutes the adenine ring- 
binding domain of the NAD+ binding site (1). In this 
peptide, Glu-13 and Trp-18 align with Glu-148 and Trp-153 of 
diphtheria toxin (DT), Glu-553 and Trp-558 of pseudomonas 



exotoxin A (ET), and Trp-26 of pertussis toxin (PT), which 
have been assigned to cross-link the nicotinamide ring of 
NAD+ (4). In the C3 enzyme of C. botulinum C-468 (17) and 
the epidermal cell differentiation inhibitor (EDIN) of Staph- 
ylococcus aureus (8), which is a related enzyme, Glu-13 is 
replaced by the equivalent residue Asp-13 (Fig. 4). By 
comparing the la sequence with those of the other ADP- 
ribosylating toxins, Glu-14 and Trp-19 of la can be aligned 
with the above conserved residues (Fig. 4). As for C3, 
EDIN, and PT, these conserved residues in la are localized 
near the N terminus. 

(ii) lb sequence. The lb 875 deduced amino acids have a 
predicted molecular mass of 98,467 Da. The N-terminal 39 
amino acids form an hydrophobic domain with an N-termi- 
nal-charged residue (Lys-6) and probably correspond to a 
peptide signal. A potential cleavage site for signal peptidase 
(Lys-39 to Glu-40) is localized at the extremity of this 
putative signal peptide. The predicted precursor protein 
consists of 836 residues (94,013 Da) and is in agreement with 
that found for the Sb precursor (92,000 Da) (19). The 
N-terminal sequence of the naturally activated form of Sb 
determined by protein sequencing (Gly-Trp-Gly-Asp-Glu- 
Asp-Leu-Asp) matches amino acids Ala-212 to Asp-219. The 
two unrelated residues could correspond to differences in lb 
and Sb sequences or inaccuracies in protein sequencing. The 
deduced polypeptide from Ala-212 to Asn-875 (664 amino 
acids) is predicted to correspond to the functional proteolyt- 
ically activated form of lb or mature lb. The predicted 
molecular mass (80,890 Da) and pi (4.67) are in agreement 
with those determined experimentally for Sb (76,000 Da and 
pi 4.7) (19) and differ slightly from those found for lb (67,000 
Da and pi 4.2) (26). 

The propeptide (Glu-40 to Ala-211), which is excised from 
the lb precursor, consists of 172 amino acids, with a pre- 
dicted molecular mass of 19,883 Da and a pi of 4.62. 

Similarity between lb and PA. By comparison with protein 
sequences available in the data bank, lb displays significant 
homology with only the PA of the anthrax toxin complex 
(13). Overall 33.9% identity and 54.4% similarity have been 
found between both protein sequences. The central regions 
of these proteins show the maximum homology, 45% iden- 
tity and 70% similarity between amino acids 268 to 531 of lb 
and amino acids 260 to 536 of PA. 

Ib and PA present similar structures. Both contain a signal 
peptide (39 residues for lb and 29 residues for PA), a 
propeptide (19,883 Da for lb and 19,243 Da for PA), and a 
mature protein (80,000 Da for lb and 63,000 Da for PA) (13). 
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40 EEITNENTLsLgLMGYYFADEHFKDLELMAPIKNGDLKFEEKKV 

: : II.. I I ■ I I : I I I I . I : I . . : : . . ..III.: : 

30 EVKQENRLLNESESSSQGLLGYYFSDLNFQAPMVVTSSTTGDLSIPSSEL 

85 DKLLTEDNSSIKSIRWTGRI IPSEDGEYILST . DRNDVLMQINAKGDIAK 

80 EN.IPSENQYFQSAIWSGFIKVKKSDEYTFATSADNHVTMWVDDQEVINK 

134 T . . .LKVNMKKGQAYNIRIEIQDKNLGSIDNLSVPKLYWELNGNKTVIPE 

129 ASNSNKIRLEKGRLYQIKIQYQREN.PTEKGLDFKLYWTDSQNKKEVISS 

181 ENLFFRDYSKIDENDPFIPNNNFFDVRFFSAAWEDEDLDTDNDNIPDAYE 

178 DNLOLPELKOKSSNS R KKR STSAG . . PTVPDRDNDGIPDSLE 

I »-PA 63 

231 KNGYTIKDSIAVKWNDSF AEQGYKKYVSSYLESNTAGDPYTDYQK 

218 VEGYTVDVKNKRTFLSPWISNIHEKKGLTKYKSSPEKWSTASDPYSDFEK 
.ATP/GTP binding site 
276 ASGSIDKAIKLEARDPLVAAYEWGVGMENLIISTNEHASS_D ._. _ QGKT 

268 VTGRIDKNVSPEARHPLVAAYPIVHVDMENIILSKNEDQSTQNTDSETRT 

322 VSRATTNSKTDANTV. . GVSISAGYQNGFTGNITTSYSHTTDNSTAVQDS 

318 ISKNTSTSRTHTSEVHGNAEVHASF .FDIGGSVSAGFSNSNSSTVAIDHS 

370 NGESWNTGLSINKGESAYINANVRYYNTGTAPMYKVTPTTNLVLD. 

367 LSLAGERTWAETMGLNTADTARLNANIRYVNTGTAP I YNVLPTTSLVLGK 

415 GETLATIKAQDNQIGNNLSPNETYPKKGLSPLALNTMDQFNARLIP INYD 
. : I I I I I I I . : I I : : . I . I I : I I . I . I . I : I I I . I : I . . I . : I I : 
417 NQTLATIKAKENQLSQILAPNNYYPSKNLAPIALNAQDDFSSTPITMNYN 

465 QLKKLDSGKQIKLETTQVSGNYGTKNSQGQ . . IITEGNSWSNYISQIDSV 
I : . I : . . I I : : I : I . I I I I . : I I : . . : . . I . . I I : : . I I : . . 
4 67 QFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQET 

513 SASI ILDTGSQTF . ERRVAAKEQGNPEDKT . PEITIGEAIKKAFSATKNG 

517 TARIIFNGKDLNLVERRI AAVNPSDPLETTKPDMTLKEALKIAFGFNEPN 

561 ELLYFNGIP IDESCVELIFDDNTSEI IKEQLKYLDDKKI YNV KLER 

: I : . I . I . I . : : I I : . I I : I I : I I I : ... I I . I I I : 
567 GNLQYQGKDITE. . FDFNFDQQTSQNIKNQLAELNATNI YTVLDKIKLNA 

607 GMNILIKVPSYFTNFDEYNNFPASWSNIDTKNQDGLQSVANKLSGETKI I 

615 KMNILIRD KRFHYDRNN I AVGADESWKEAHREVINSSTE 

657 1PMSKLKPYKRYVFSGYSKDPSTSNSIT VNIKSKEQKTDYL 

655 GLLLNIDKDIRKILSGYIVEIEDTEGLKEVINDRYDMLNISSLRQDGKTF 

698 VPEKDYTKFSYEFETTGKDSSDIEITLTSSGVIFLDNLSITELNSTPEIL 

: I : . 1 : : . : . I I . . : I : . : 

705 I. . . DFKKY NDKLPLYISNPNYKVNVY AVT 

748 KEPEIKVPSDQEILDAHNKYYADIKLDTNTGNTYIDGIYFEPTQTNKEAL 

732 KENTIINPSEN GDTSTNGI KKIL 

798 DYIQK . YRVE 

755 IFSKKGYEIG 

FIG. 5. Alignment of lb and PA (top and bottom lines, respec- 
tively, of each pair of lines) amino acid sequences. The numbering 
begins at the start of the signal peptide of lb and PA. Arrows indicate 
the start of the propeptide of lb and PA (PA20) and the start of mature 
lb and PA (PA63). The furin cleavage site of PA is underlined. The 
predicted transmembrane segments are in boldface, and the predicted 
lb ATP/GTP binding site is indicated by a dashed line. Sequences 
were aligned by the GAP program of the Genetics Computer Group, 
University of Wisconsin. Symbols vertically connecting homologous 
residues in the two sequences are "I" for identical residues, ":" for 
highly similar residues, and "." for less similar residues. 



It has been shown that the cleavage between the propep- 
tide and mature PA is carried out by a cellular membrane 
protease with the specificity of furin at the site 164-Arg-Lys- 
Lys-Arg-167 (Fig. 5) (11). A consensus furin cleavage site 
has not been found in the lb sequence. 




A 8 
FIG. 6. Analysis of la expression in E. coli and C. perfringens. 
(A) SDS-PAGE (12% polyacrylamide) of protein extracted from E. 
coli harboring pMRP67 (lane 2) or pMRP91 (lane 3), of culture 
supernatant of C. perfringens (pMRP76) (lane 4), and of purified la 
from recombinant C. perfringens (pMRP76) (lane 5). Molecular 
mass standards are shown in lane 1. (B) Western blot with la 
antibodies. Lanes 2 to 5 are the same as described in panel A. 



The hydrophobic sequence Leu-292 to Ser-309 of lb is 
predicted to form a transmembrane segment by using the 
program described by Klein et al. (9). This segment shares a 
high degree of homology with a sequence found in PA: 
Leu-284 to Leu-300 (12 identical residues of 17) (Fig. 5). The 
hydrophobic PA segment is predicted to form an uncertain 
transmembrane segment (prediction by the quadratic func- 
tion but not by the linear function of the program of Klein et 
al. [9]). These data suggest that these sequences could 
constitute a common functional domain in lb and PA which 
could probably be involved in the translocation of the toxin 
across the cell membrane. 

An ATP/GTP binding [consensus GXXXXGK(TS)] (21) 
has been identified in the lb sequence from positions Ala-314 
to Thr-321 (ASSDQGKT) (Fig. 5). This motif has not been 
found in the PA sequence. 

Expression of la gene in recombinant E. coli and C. 
perfringens. E. coli TGI harboring the recombinant plasmid 
pMRP67 and C. perfringens S13 harboring pMRP76 (corre- 
sponding to the shuttle vector pJIR418 with the pMRP67 
insert cut by Sacl-Sall localized on the multiple cloning 
sequence) were analyzed for expression of the la gene by 
Western blotting (immunoblotting) and ADP-ribosylation of 
actin. As shown in Fig. 6, a 44-kDa peptide recognized by 
antibodies raised against native purified la was found in cell 
extracts of the recombinant E. coli and C. perfringens strains 
and in culture supernatant of C. perfringens harboring 
pMRP76. The affinity-purified la from the culture superna- 
tant of the recombinant C. perfringens exhibited the same 
electrophoretic mobility as the native Sa (data not shown). 

The pMRP91 corresponding to the subcloning into pA- 
CYC184 of the EcoRl-BclI fragment of pMRP67 also in- 
duced the production of the la component (Fig. 6). These 
results indicate that the promoter of the la gene is localized 
in the 245 nucleotides upstream of its start codon and that 
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FIG. 7. Autoradiogram of analysis of la expression by ADP-ribosylation of actin as described in Materials and Methods. Native Sa as the 
positive control (lane 1) and cellular extract and culture supernatant of recombinant C. perfringens (pMRP76) (lanes 4 and 5, respectively), 
of recombinant E. coli (pMRP67) (lanes 8 and 9, respectively), of C. perfringens control (pJIR418) (lanes 2 and 3, respectively), and of E. coli 
control (pUC19) (lanes 6 and 7, respectively) are shown. Molecular mass markers are indicated on the left. 



the 1,219 nucleotides upstream of the EcoRl site, in 
pMRP67, are not required for la gene expression. Although 
pMRP67 and pMRP76 contain a part of the lb gene (DNA 
fragment encoding the signal peptide, propeptide, and the 99 
N-terminal amino acids of the mature lb), no lb products, 
recognized by polyclonal antibodies against lb, were ob- 
served by Western blotting (data not shown). 

As shown in Fig. 7, cell extracts and culture supernatants 
of recombinant E. coli and C. perfringens strains exhibited 
ADP-ribosyltransferase activity with actin as the substrate. 
The enzymatic activity is greater in the cell extract of E. coli 
and the culture supernatant of C. perfringens than in the 
culture supernatant of E. coli and the cell extract of C. 
perfringens, indicating that the recombinant la is secreted by 
only C. perfringens. The time course of la production 
showed that la was released in C. perfringens culture 
supernatant early during the growth phase. In E. coli, 
however, low amounts were released in late growth phase, 
probably because of spontaneous cell lysis. 

Total la production (in cell lysate and culture supernatant) 
was compared in E. coli and C. perfringens by using a similar 
concentration of bacteria (10 8 bacteria per ml, counted in a 
Malassez chamber) in stationary growth phase. Taking into 
account that the copy number of pJIR418 in C. perfringens is 
15-fold less than that of pUC19 in E. coli (24), it can be 
estimated that C. perfringens produces 8-fold more la than 
E. coli. 

Expression of the lb gene in recombinant E. coli. The 
recombinant plasmid pMRP108 was constructed as shown in 
Fig. 8. This plasmid encompasses the putative promoter 
upstream from the la gene and the la and lb genes. As shown 
in Fig. 9, the cell extract of E. coli TGI harboring pMRP108 
exhibited 90- and 60-kDa peptides recognized by specific 
anti-Sb antibodies. In length, the 90-kDa peptide corre- 
sponded to the recombinant mature lb and its propeptide. 
But, the mature lb is not processed in E. coli TGI, indicating 
that the cleavage site between the propeptide and mature lb 
is not recognized by E. coli proteases. The la gene was also 
expressed in recombinant E. coli TGI harboring pMRP108 
as well as pMRP91 (data not shown). 



DISCUSSION 

We show that the two independent components la and lb 
of the iota toxin are encoded by two genes separated by 243 
noncoding nucleotides and preceded by a typical Shine- 
Dalgarno sequence. DNA sequence analysis suggests that 
the iota-toxin genes are under the control of the same 
promoter upstream of the la gene. A characteristic hydro- 
phobic signal peptide has been identified at the N termini of 
both la and lb sequences. This is in agreement with the fact 
that native iota toxin is exported from C. perfringens (27). 

The iota-toxin sequence seems to be closely related to that 
of C. spiroforme toxin. The protein sequencing of the C. 
spiroforme toxin peptides, which correspond to a total 
number of 48 residues, shows 83.4% identity with the 
deduced amino acid sequence of the iota toxin. It has not 
been excluded that the identity level is higher, since protein 
sequencing is not as accurate as DNA sequencing. These 
data are in agreement with the close immunological relation- 
ship found between both toxins (16, 19). 

The la gene has been expressed in E. coli and C. perfrin- 
gens under the control of its own promoter. The la expres- 
sion yield was higher in C. perfringens than in E. coli. The 
recombinant la was recognized by antibodies raised against 
native la and exhibited ADP-ribosyltransferase activity with 
actin as the substrate. Moreover, the recombinant la, in 
association with native Sb, induced morphological alter- 
ations of Vero cells as native iota toxin and C. spiroforme 
toxin did (data not shown). These data indicate that the la 
gene that we have identified corresponds effectively to the la 
structural gene. 

The lb gene was also expressed in E. coli, but the 
processing of the mature lb did not occur in E. coli. It seems 
that the la and lb genes form an operon under the control of 
a same promoter upstream from the la gene, since no 
promoter consensus sequence has been found upstream the 
lb gene. The apparent lack of expression of the truncated lb 
gene in pMRP67 and pMRP76 could be explained by an 
absence of epitopes recognized by the anti-Sb antibodies. 
Further work is need to precisely determine the transcription 
of these two genes. 
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FIG 8 Construction of recombinant plasmid for lb expression. The 3,351-bp EcoRI fragment encompassing the la gene and most of the 
'b gene was cloned from C. perfringens NCIB 10748 DNA into pUC19 (pMRP105). The £coRI site upstream from the la gene was deleted 
■ ■ ' " - - >a-base kit, yielding pMRP107. The £coRI insert of pMRP82 was cloned into the £coRl site of pMRP107 (pMRP108). The 



The la sequence shows two residues (Glu-14 and Trp-19) 
which can be aligned with amino acids involved in the 
NAD+ binding site of DT, ET, PT, and C3. The consensus 
motif (Glu/Asp)XXXX-Trp seems to be involved in the 
NAD+ binding site of these ADP-ribosylating toxins. How- 



ever, la and C3 enzymes do not show a site equivalent to 
His-21 of DT, which is conserved in the other ADP-ribosy- 
lating toxins and has been found to be essential for ADP- 
ribosylating activity (4). As the substrate of ADP-ribosyla- 
tion of la is an ATP-binding protein (actin) and a GTP- 





FIG. 9. Analysis of lb expression in E. coli. (A) SDS-PAGE (12% polyacrylamide) of E. coli TGI harboring pUC19 (lanes 1 and 2) or 
pMRP108 (lanes 3 and 4) (cell extract and culture supernatant, respectively) and of purified Sb (lane 5). (B) Western blot with specific anti-Sb 
antibodies. The lanes are the same as for panel A. 
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binding protein for the other ADP-ribosylating toxins, the 
NAD+ binding site could be slightly different in la. 

The lb protein can be divided in three domains (signal 
peptide, propeptide, and mature protein) denned by two 
protease cleavage sites. The presence of a propeptide lo- 
cated between the signal peptide and the mature protein is a 
common feature of most bacterial extracellular proteases, in 
which it is involved in the correct folding of the mature 
protein (30). But, no significant homology of lb with known 
bacterial protease sequences has been found. The epsilon 
toxin produced by C. perfringens types B and D is secreted 
as a protoxin (32 kDa), which is trypsin activated by cleav- 
age of 13 basic N-terminal amino acids (7). The propeptide of 
lb (172 amino acids) is much longer, and its release does not 
induce a significant pi change in the protein. 

Ib displays a similar structure and significant homology 
with the PA component of the anthrax toxin (13). Both 
proteins act as a binding component between cell surface 
receptors and the enzymatic component and trigger the toxin 
internalization allowing the delivery of the enzymatic com- 
ponent to its intracellular target. However, PA exhibits some 
modes of action different from those of Ib. The PA precursor 
binds to cell surface receptors (the cell recognition domain is 
located between residues 315 and 735) and is cleaved by a 
cell surface protease (furin). The propeptide is released, and 
the mature PA remains bound to the cell surface receptor 
and is able to bind either the enzymatic component edema 
factor (an adenylate cyclase) or lethal factor (a metallopro- 
tease) (10, 13). Residues 168 to 312 of PA encompasse the 
binding site for the edema factor and lethal factor (11, 13). In 
contrast, Ib needs to be proteolytically activated prior to 
being functional on cultured cells. These data suggest that 
the cell surface proteases are probably inactive on Ib. 
Moreover, the furin cleavage site (Arg-XX-Arg) has not been 
found in the Ib sequence. The cleavage site Ala-211 to 
Ala-212 between the propeptide and mature Ib, which has 
been determined by protein sequencing of the N terminus of 
mature Ib, corresponds to a common signal peptidase site 
(29). 

The predicted transmembrane segment (Leu-292 to Ser- 
308) of lb is highly homologous to the PA segment (Leu-284 
to Leu-300) and could be involved in the toxin translocation 
across the cell membrane. Other binary toxins, such as DT, 
present hydrophobic helices predicted to be transmembrane 
segments in their translocation domain (2). In addition, the 
Ib sequence displays an ATP or GTP binding site (Fig. 5), 
which could be nonfunctional. However, it is not known 
whether ATP is required for the entry into the cells of iota 
toxin. 

The relatedness between Ib and PA suggests that they 
could be derived from a common ancestor gene. Moreover, 
the GC contents of lb and PA genes (28 and 31 mol%, 
respectively) are nearly the same. The central domain impli- 
cated in the translocation of the toxin across the cell mem- 
brane corresponds to the most conserved region, suggesting 
that both iota and anthrax toxins probably use a common 
mechanism of cell entry. The molecular analysis of the other 
binary toxin genes will provide further information about the 
evolution of this toxin family. 
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